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Abstract 

In all setups when the structure of UMVUEs is known, there exists a subalgebra 11 
(MVE-algebra) of the basic cr-algebra such that all U-measurable statistics with finite 
second moments are UMVUEs. ft is shown that MVE-algebras are, in a sense, similar 
to the subalgebras generated by complete sufficient statistics. Examples are given when 
these subalgebras differ, in these cases a new statistical structure arises. 
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1 Introduction 


Among C. R. Rao’s major contributions to the foundations of statistical inference (Cramer- 


Rao inequality, Rao-B 


ing observation in 


Rao 


ackwe’ 


lization, Rao’s score test, to name the best known), the follow- 


(119521 ) is not widely known. 
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Let (X, A, 7) be a standard statistical model with (X, A) a measurable space, and let 
7 = {Pg,0 G 0} be a family of probability distributions of a random element X G (X, .A) 
parameterized by a general parameter 9. 

A statistic g(A) with Eg\g(X)\ 2 < oo, 9 G 0 is called a UMVUE (uniformly minimum 
variance unbiased estimator), more precisely, the UMVUE of g(9) = Eg'g(X) if 


Var e (g(X)) < Var e (g(X)) , W9 G 0 


for any statistic g(X) (with finite second moment) with E e g(X) = Eg'g(X), 9 G 0. 

If gi (AT) and g 2 (A - ) are UMVUE’s, so is their linear combination c\(j\ (X)+c 2 g 2 (A), ci, c 2 G 
M. Rao observed that their product g(X) = g\ (X)g 2 (A") is also a UMVUE provided that 
E ( )[g (A) | 2 < oo, 9 G 0 (this is not guaranteed by Eg\gi (A) | 2 + E e \g 2 (A) | 2 < oo, 9 G 0). 

In all known (to the authors) setups, the class of UMVUE’s has the following structure: 
there exists a subalgebra (an MVE-algebra) AEA such that a statistic g{ A) is a UMVUE 
if and only if p (A) is A-measurable. Combining all this leads to a hypothesis that such is 
the structure of UMVUEs in general. The results in the paper are closely related to this 
hypothesis, which, if true, introduces a new statistical structure. 

In Section [2] a few setups with known structure of UMVUEs are presented. The first one 
(see Subsection 12.ip is classical, in a sense, of a family 7 possessing a complete sufficient 
statistic/subalgebra. The MVE-algebra is complete sufficient subalgebra and every estimable 
parametric function possesses a UMVUE. The other setups, when the minimal sufficient 
subalgebra is incomplete, are less known. Plainly, due to the Rao-Blackwell theorem, MVE- 
algebras are subalgebras of the minimal sufficient subalgebras. In Subsection 12.21 the setup 
of a partial complete sufficient subalgebra is considered, that formally generalizes the setup 


of Su bsecti on 12.11 ( 


paper 


Kagan et al. 


' or a re lation between partial completeness and completeness see a recent 


(2014)). In Subsection l2.3l a geometric construction leads to a description 


of the MVE-algebras in case of categorical X. Lehmann’s example presented in Subsection 
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I2.4l deals with discrete X taking countable many values and thus is not covered by the results 
of Subsection 12.31 

In Section [3] some results for arbitrary (not necessarily generated by complete sufficient 
statistics) MVE-algebras are proved, showing that the MVE-algebras preserve properties of 
complete sufficiency. 

le existence of UMVUEs in the Nile problem by R. Fisher 


(1197 3l)) and related problems attracted recen tly som e atte ntion (see 


see 


Fisher (1936 


For its partial solution see 


Kagan and Malinovs 


statistic, and sufficient subalgebras we refer to 


ky 

(2013) 

. For 

Bahadur 

(1957) 


Navak and S inha ( 2012 )). 


Fisher 


he relation between sufficient 


2 Setups with known structure of UMVUEs 

A statistic y(A") is called an unbiased estimator of zero (or zero mean statistic) if 

E eX (X) = 0 , V0G0. 

A well known result (see, e.g., L ehmann and Casellaj (119981) . p. 85) gives a necessary and 


sufficient conditions for a statistic g(X) to be a UMVUE. A statistic g(X) is a UMVUE if and 
only if it is uncorrelated with any zero mean statistic %(A) with Eg\x(X)\ 2 < oo, V 6 £ 0. 
Its immediate corollary is that the class of UMVUEs is a linear space: if r/) and g 2 are 
UMVUEs, so is 'g(X) = c{gi(X) + (>252 (A) for any constants ci, C 2 . 

In the following subsections, setups with known structure of the class of UMVUEs are 
presented. 
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2.1 Families with complete sufficient subalgebras/statistics 

Let a statistic S = S(X), S : (X,A) —> (S, CB) be a complete sufficient statistic for IP, i.e., 
Eq\ (S(X)) = 0, V 9 e 0 implies 


P g (y = 0) = 1 for all deQ. 


( 1 ) 


Equivalently, the subalgebra A = S'^ 1 (!B) of A is complete sufficient if any A- measurable 
Y with Eqx = 0 implies (P). The concept of completeness is due to Lehmann and Scheffe 
and its role in the estimation theory is explained by the following. If A (resp., S(X)) is 
a complete sufficient subalgebra (resp., statistic) for CP, then ^.-measurable statistics (resp., 
statistics depending on X only through S(X)) with finite second moment and only they are 
UMVUEs. 


Notice that the factorization theorem does not distinguish between complete and in¬ 


complete sufficiency. Prov ing (or disprov ing) 


an additional analysis. 


Lehmann and Scheffei (119501) (see also 


completeness o 


a su ffici ent statist ic r equire s 


L ehmann and Casella (119981) ) 


showed that for the natural exponential families of full rank, the minimal sufficient statistic 
is complete. 


2.2 A case of partial completeness 


The following interesting observation is due to 


Bondessonl ( 19831) . Let CP = {Pq^ , (9,rj) £ 


0 x 5} be a family of distributions on (X, A) parameterized by a bivariate parameter. Sup¬ 
pose that for any fixed r]* e 5, a subalgebra A (resp., a statistic S(X)) is complete sufficient 
for 9 (i.e., for the family CP* = {Po )V *, 9 e 0}). Then any yf-measurable statistic (resp., 
depending on X through S(X)) with finite variance is a UMVUE. The following models 
are covered by the Bondesson’s result. Suppose that the probability density function of 
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X = (Xi,X 2 ) is factorized as 


f(x i,x 2 ; 0 ,rj) = Ri(xi, 6 )R 2 {x 2 \ rj)r{x\, x 2 ). 


For any fixed 77 = 77 *, X 1 is sufficient for 6 . ff X 1 is complete sufficient, then any statistic 
~g(X 1 ) is a UMVUE. One can notice that, in general, the parametric function E$ jV g(Xi) = 
g(0,rj) depends on the who le parameter ( 6 , 17 ) and not only on the first component. The 


original example due to 


Bondesson (119831 ) is a sample X = (X 1; ... ,X n ) from a gamma 


population with density function 


c(a,^) 


f(x; 9,v) = - V) a 


37 — 77 

e _33 ‘, x > rj 


with (6*, 77) G M + x R as parameters, and a ,/3 (a > 0 ,a ^ 1,/f > 0 ) known. For any fixed 
77 = 77*, X is complete sufficient for 6 . Any statistic p(X) (with finite second moment) is the 
UMVUE for En v g(X) = g(0, 77 ). In particular, X is the UMVUE for Eg iV (X) = 77 + a/30. 


Actually, 


Bondesson! (119831) arguments prove a more general result. Namely, if A is an 


MVE-algebra for (P* (not necessarily a complete sufficient subalgebra) for any fixed 77 *, then A 
is an MVE-algebra for CP. Indeed, if x (V) is a zero-mean statistic with finite second moment 
for CP, it is zero-mean statistic for CP*. So that due to the assumption any .A-measurable ~g(X) 
with E d r/ *\'g(X )\ 2 < 00 one has 


Eo,rr{gx) = 0, V0G0. (2) 

Since (J2]) holds for any 77 * G 5, 'g is a UMVUE for CP. One can notice that the reverse is not 
true, in general. If 77 is a UMVUE for CP*, it is not necessarily a UMVUE for CP. The thing is 
that the relation Eq jV *x(X) = 0 for some 77 * G 5 and all 6 G O does not imply E 6 ri x(X) = 0 
for all 7 / G 5, 6 G O, so there are more zero mean statistics for CP* than for CP, in general. 


5 









2.3 UMVUEs from categorical data 


Let X be a categorical random variable whose values may be taken as 1,2 ,N. The 
distribution of X is given by 

P(X = k-,9)=p k (9), k — 1,..., N 

with d G © as a parameter. Plainly, only parametric functions from L = span {pi(0), p 2 (6 ),..., Pn(9)} 
are estimable, i.e., can be unbiasedly estimated. The set M = {pi(0) ,p 2 (0), • • • ,Pn( 0 )} can 
be partitioned into 

M = Mi U ... U M r 

in such a way that 

(i) the subspaces Li = span {M{\ ,..., L r — span {M r } are linearly independent, and 

(ii) the partition is maximal, i.e., if for some Z, Mi = M[ U M ", M[ CiM'i = 0 and (i) holds 
for the new partition, then either Mi = M[ or Mi = M". 

Such a partition is unique up to ordering. Without loss of generality, one may assume 

Mi = (pi(0), • • -iPkAO)} , M 2 = {p kl+ i(d),... ,p kl+k2 (d)} 

M r = {pk 1+ ...+k r - i+i(0), • • • ,Pn(0)} . (3) 

The partition (J3J) generates a partition of the set {1, 2 ,..., N}: 

I\ U / 2 U ... U I r = {1,..., fci} U {k\ + 1,..., k\ + /c 2 } U ... U {/ci + ... + k r —i + 1,..., iV} . 

( 4 ) 

A statistic 'g(X) is a UMVUE if and only if it is constant on the elements of (]Tj), 

g(x) = const = g v x e Ij, j = 1 ,..., r. 
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The subalgebra of UMVUEs is generated by sets (J4J. The class of parametric functions 
admitting UMVUEs is a linear subspace of L , 

span { 711 ( 0 ),...,7r r (0)}, nj(9) = ^2p k (9), j = l (5) 

kGlj 

Note that S(X) is the minimal sufficient statistic for 6 if and only if S(k) = S(l) is equivalent 
to 


Pk{0) = c k ipi(9), VO G 0 


( 6 ) 


for some constant c^i > 0. Due to (i), a pair (k, l) G {1, 2,..., N} with (fh| always b elon gs to 
the same element of the partition (|4j) . The above construction is due to iKagan and Konikov 


(120061 ). The following example with 


Pi(9) = 0, p 2 {9) = 0\ p 3 (9) = 0 + 6 2 , p A (0) = 1 - 2 0 - 2 0 2 , 0 e (0,1/4) 


illustrates the situation when the minimal sufficient statistic is trivial and incomplete while 
the subalgebra of UMVUEs is generated by two sets, {1,2,3} and {4}. Therefore only 
elements of span {1, 9 + 0 2 } possess UMVUEs. 


2.4 Lehmann’s example 


he fo llowing example is due to lhehmann and Scheffd (1195011 (see also lLehiriami and Casella 


( 19981 1. pp. 84-85). Let X G { — 1, 0,1, 2,...} with 

P e (X = -1) = 0, P e (. X = k) = { 1 - 0) 2 0\ 0 G (0, 1), k = 0,1, 2,... 


It is easy to see that all unbiased estimators of zero are of the form 


U(X)=aX for some a G M. 

If g{X) is a UMVUE, then 

E e {g(X)X} = 0, 0e (0,1) 
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so that X'g(X) is itself an unbiased estimator of zero. Thus, 


xg{x) = ax, x G { — 1, 0,1, 2,...} for some a G M, 


whence 


fj{x) = g(— 1 ) for all x ^ 0 , with an arbitrary g( 0 ). 


The subalgebra of UMVUEs in this example is generated by two sets, {0} and {—1,1,2,.. 
The parametric functions possessing UMVUEs are elements of span{l, (1 — 0) 2 }. 


3 Properties of MVE-algebras 

In this section some properties of MVE-algebras are presented. They are similar to properties 
of complete sufficient statistics/subalgebras and, in our opinion, are an argument in favor 
of that under rather general conditions on the statistical model (X, A, 7), there exists a 
maximal MVE-algebra A such that a statistic g(X) with E , e|g(X )| 2 < oo, V 6 G 0 is a 
UMVUE if and only if it is A- measurable. 

Recall that a subalgebra A C A is an MVE-algebra if any ^.-measurable statistic fj{X) with 
finite second moment is a UMVUE. 

Theorem 1 . If A\,A^ are MVE-algebras, so is A = cr(.Ai, A 2 ), the smallest a-algebra 
containing both A\ and A 2 (in other words, generated by A\ and A 2 ). 

In terms of statistics, if U\(X), f/ 2 (X) are such that any estimators gi(Ui(X)), < 72 (^(X)) 
with finite second moments are UMVUEs, so is g{U\{X), [^(X)) with finite second moment. 

Proof. For A\ G A\,A 2 G A 2 set hi(x) = l{ xG Ai},i = 1,2. Since h\{X) is a UMVUE, 
cov 0 (hi(X),x(X)) = E e {h 1 {X),x{X)) = 0,V 6 G 0 for any y(X) with E g (x{X)) = 
0, Eg\x(X)\ 2 < 00 V 6 G 0. Thus xi(X) = hi(X)x(X) is also a zero mean statistic. Since 


MX)|<1, U 0 |y(X)| 2 <oo, 


cov, (h 2 (X),xi(x)) = E e (/i 2 (X)/n(X)x(X)) =0, V 6 G 0 
and /ii(X)/i 2 (^) = l{xeAinA 2 } is a UMVUE. 

Let now An G yii , ^4 2 * G A 2 ,i = 1, ...,n with indicators hu(x) = l{ X GA li } ) h 2 i(x) = 
l{xeA 2 i}ji = 1,..., n. The above arguments prove that for any constants c 1? ... ,c n 

n 

J2cih u (X)h 2 i (X) (7) 

i =1 

is a UMVUE. As is well known, the functions 0 are dense in the Hilbert space L 2 e ^A^j of 
A-measurable functions h(X) with j \h(x) ; dPg(x) < oo. For an A-measurable statistic h(X) 
with finite second moment and given e > 0 take h{X) of the form (J7J) with Ee\h(X) —h(X )\ 2 < 
e 2 . Now, for any unbiased estimator of zero y(XC) with finite second moment, 

|cov e (h(X), X (V)) | = |cov e (h(V), X (X)) +cov, (h(X) - h(X), X (X)) | < e^E d |y(X)| 2 . 

Since, e > 0 is arbitrary, cov 0 (h(X), y(X)) = 0 and h(X) is a UMVUE. □ 

The next claim is an immediate corollary of Theorem 2. 

Corollary 1. Let |a 7 ,7 G T | be the collection of all MVE-algebras indexed, by ^ £ T. Then 
A = cr(A 7 , 7 6 T) is the maximal MVE-algebra. 

Turn now to the setup of combining independent data. Let (Xj, A;, CP*) be statistical 
models with CP* = { Pq 1 , Qi G 0*}, 7 = 1, 2. Set 

(X, A, CP) = (Xi x X 2 , Ai <g> A 2 , ?! x CP 2 ) 

is a statistical model with CP = {Pq = Pq 1 x Pq 2 } parameterized by a “bivariate” parameter 
0 = ($i, 0 2 ) G ©i x @ 2 = 0. 
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Theorem 2. If Ai C Ai is an MVE-algebra in the model (Xj, Ai, CPj), i = 1,2, then 
A = A\ ® A 2 is an MVE-algebra in the model (X, A. CP). 


In terms of statistics, let X t ~ P 9 ., i = 1, 2 and X = (Ad, AC 2 ) ~ P 9 . If statistics 
Ui(Xi), i — 1, 2 are such that any estimator gi(Ui(Xf)) with a finite second moment is a 
UMVUE of a parametric function gi(Qi),i = 1, 2, then any estimator g{U\{ Ad), U 2 (X 2 )) 
with finite second moment is a UMVUE of some g( 6 1 , 0 2 ). 

Proof. In view of Theorem 2, suffice to prove that A\ is an MVE-algebra in the model 
(X, A , CP). Let y(Ad, X 2 ) be a zero mean statistic with finite second moment so that 




|x(xi, x 2 )\dP 0 1 (x 1 )dPe 2 (x 2 ) < 00 , (ffi, 0 2 ) G 0. 


By Fubini theorem, for any 0 2 the function 


X( X V ^ 2 ) = / x(xi, x 2 )dPe 2 {x 2 ) 

Jx 2 

is well defined for Per -almost all X\ and is a zero mean statistic for the model (Xi, A±, CPi). 
Thus, for any A\ -measurable statistic U\{X\) with finite second moment, 


0 = cov(U 1 (X 1 ),x(X 1 - 1 6 2 ))= U 1 (x 1 )x(x 1 ;d 2 )dPe 1 (x 1 )= / U 1 (x 1 )x{x 1 ,x 2 )dP 9 l (x 1 )dP e2 

Jx 1 Jx 1 Jx 2 


and Ui(Xi) is a UMVUE in the combined model (X, A, CP). 


□ 


If in Theorem 3, the condition “Ai C Ai is an MVE-algebra for 9 l} i — 1, 2” is replaced 
with “Ai C Ai is a complete sufficient subalgebra”, then the claim may be replaced with 


“A 


111 


= Ar (g) A 2 i s a c omplete sufficient subalgebra for (ffi, 0 2 )”. This result was proved 


Landers and Rogge (119761) (where it is s 


;ated for arbitr ary subalgebras, n ot necessarily 


sufficient) strengthening previous results by 
some additional conditions on the models. 


P lachkv (119771 ) and 


Fraserl (119571) that required 


10 
















4 Comments 


In conclusion we would like to make some general comments on sufficiency and complete 
sufficiency. While sufficiency plays a fundamental role in all areas of statistical inference, 
complete sufficiency is tailored for the estimation. In a sense, while sufficiency is a statistical 
concept, completeness looks more like a mathematical-statistical tool. It seems that a more 
general concept of an MVE-algebra preserves the basic property of complete sufficiency in 
some setups when the (minimal) sufficient statistic is incomplete. 
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